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Abstract 

The capacity regions of semideterministic multiuser channels, such as the semideterministic relay channel and the multiple 
access channel with partially cribbing encoders, have been characterized using the idea of partial-decode-forward. However, the 
requirement to explicitly decode part of the message at intermediate nodes can be restrictive in some settings; for example, when 
nodes have different side information regarding the state of the channel. In this paper, we generalize this scheme to cooperative- 
bin-forward by building on the observation that explicit recovering of part of the message is not needed to induce cooperation. 
Instead, encoders can bin their received signals and cooperatively forward the bin index to the decoder. The main advantage of 
this new scheme is illustrated by considering state-dependent extensions of the aforementioned semideterministic setups. While 
partial-decode-forward is not applicable in these new setups, cooperative-bin-forward continues to achieve capacity. 

Index Terms 

Cooperative-bin-forward, Cribbing, Multiple-access channel. Relay channel. State, Semideterministic 


I. Introduction 

The capacity region of the semideterministic relay channel, depicted in Figure [T] is characterized in |[T] using the partial- 
decode-forward scheme. In this scheme, the source splits its message into two parts and encodes them using superposition 
coding. The relay decodes one part of the message, and maps this to a codeword to be transmitted in the next block. The 
codebooks at the source are generated conditioned on the relay’s transmission, which results in coherent transmissions from 
the source and the relay. 

Consider now the extension of this model depicted in Figure which corresponds to a state-dependent semideterministic 
relay channel where the state information is causally available only at the source and the destination. This model captures the 
natural cellular downlink scenario, in which training enables the source and the destination to learn the channel gain between 
them (state = channel gain), while a relay could be potentially available to assist the communication, e.g. a wifi access point. 
In this scenario, it is typically unrealistic to assume that the relay is also able to obtain timely information about the channel 
state between the source and the destination. In this case, requiring the relay to still decode part of the source message, without 
any state information, is unduly restrictive and to our knowledge the capacity remains unknown to date. 

The main contribution of this paper is to develop a new scheme which we call cooperative-bin-forward. This new scheme 
does not require the relay to decode part of the message; instead, the relay simply bins its received signal and maps the bin-index 
to a codeword to be transmitted in the next block. As in partial-decode-forward, the codebooks at the source are generated 
conditioned on the relay’s transmission, resulting in coherent cooperation. This cooperative aspect of the scheme distinguishes 
it from bin-forward (a.k.a. hash-forward) that has been considered previously for primitive relay channels in Q. For the vanilla 
semideterministic relay channel in Figure cooperative-bin-forward recovers the capacity achieved by partial-decode-forward. 
However, while partial-decode-forward is not applicable for the state-dependent semideterministic relay channel in Figure 
we show that cooperative-bin-forward continues to achieve capacity. 



Fig. 1: Semideterministic Relay Channel 
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Fig. 2: State-dependent Semideterministic Relay Channel with Causal State Information at Source and Destination 


We next consider another setup where partial-decode-forward is known to be capacity achieving, the multiple-access channel 
(MAC) with strictly causal partial cribbing encoders, depicted in Figure The MAC with cribbing has been introduced by 
Willems in 0 and its generalization to partial cribbing has been studied in 0. Compared to the canonical MAC, transmitters 
here can overhear each other’s transmissions while simultaneously transmitting their own data. This possibility, increasingly 
enabled today by the development of full-duplex radios, is especially appealing since such overheard information can be 
exploited to induce cooperation among the transmitters by exploiting the natural broadcast nature of the wireless medium without 
requiring any dedicated resources. Partial cribbing refers to the assumption that the overheard signal is some deterministic 
function of the signal transmitted by the other transmitter, which allows to capture the signal degradation in the cribbing link 
via a simple model|^The MAC with partial cribbing can be regarded as a generalization of the semideterministic relay channel 
in the sense that when one of the transmitters does not have a message and does not have an outgoing cribbing link, the former 
reduces to an instance of the latter. 
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Fig. 3: Multiple Access Channel with Strictly-Causal Partial-Cribbing Encoders 


As in the semideterministic relay channel, partial-decode-forward achieves the capacity r egio n of the MAC with partial 
cribbing 0. Here, we consider a natural extension of this setup with states, depicted in Figure 2^ The most prevalent example 
of a situation captured by this model is cellular uplink. In cellular communication, training enables a transmitter to learn the 
channel gain between itself and the receiver, but assuming knowledge of the channel between any other transmitter and the 
receiver is unrealistic. Hence, the model in Figure includes a state composed of two components, each known causally only 
to the corresponding transmitter. These two components are not necessarily independent. Since encoders do not share common 
state information, partial-decode-forward becomes too restrictive for this setting. Instead, an achievability scheme based on 
cooperative-bin-forward provides the capacity region. 

Finally, motivated by the relay-without-delay channel considered in 0, we consider “without-delay” variations of the two 
state-dependent setups described above, that are depicted in Figure and Figure respectively. In these setups, the strict 
causality of one of the links is replaced by causality. In the former, which is the state-dependent semideterministic relay- 
without-delay channel, the transmission of the relay is allowed to depend on its past and current received signal. The capacity 
region for this setup without state is characterized in 0, using partial-decode-forward combined with instantaneous relaying 
(a.k.a. codetrees or Shannon strategies). The latter is a state-dependent multiple access channel with one strictly causal and one 
causal partial cribbing link. The capacity region for this setup without states is characterized in 0, again using partial-decode- 
forward combined with instantaneous relaying. We show that cooperative-bin-forward combined with instantaneous relaying 
achieves the capacity regions of these setups too, while partial-decode-forward suffers from the same shortcoming encountered 
for the previous two setups. 


'Earlier work W has observed that even if a very coarsely quantized signal is overheard, it can still be sufficient to achieve rates that are close to the 
best rates achievable with the unrealistic perfect cribbing (overhearing via a noiseless link). Thus, we can in fact manually perform coarse quantization of 
the overheard signals to simplify operations without a significant loss in performance, while the coarseness simultaneously lends justification to modeling the 
output of the noisy overhearing channel as a deterministic function of the input. 

^This setup is more general than the setup in Figure]^ in the sense that there are two messages and two states, however it is also special in the sense that 
the partial cribbing links are of the form zi(Xii, Sn) and Z 2 {X 2 i, S 2 i), instead of zi(Xu, X 2 i, Su) and Z 2 {Xii, X 2 i, S 2 i)- 
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Fig. 4: State-dependent Multiple-Access Channel with Strictly-Causal Partial-Cribbing Encoders 



Fig. 5: State-dependent Semideterministic Relay-Without-Delay Channel with Causal State Information at Source and Destination. For this to make sense, we 
need to define the received signal at the relay so that it does not depend on the current transmission of the relay, in contrast to Figure 
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Fig. 6: State-dependent Multiple-Access Channel with Partially-Cribbing Encoders, one strictly causal and the other causal. Note that both links cannot be 
changed to causal. 


Related Work 

We describe here some multiuser setups considered in literature that involve state-dependent channels and/or some form of 
cooperation, and how they relate to the setups considered in this paper. 

Various cases of state-dependent relay channels have been considered in 0-II3- The achievability schemes in these 
works combine well-known block-Markov relaying ideas such as partial-decode-forward and compress-forward with Shannon 
strategies or (Gelfand-Pinsker) multicoding. A class of state-dependent orthogonal relay channels with state information only 
at decoder was considered in GD’ and optimality of a partial-decode-compress-forward scheme was proved. To the best of our 
knowledge, the state-dependent relay channels considered in this paper have not been previously studied, and as mentioned in 
the introduction, standard combinations of available ideas are not sufficient to obtain good achievability schemes. 

There has been interesting recent work on state-dependent multiple access channels where the state is only known to the 
encoders. The encoders are not allowed to cooperate in these setups, so the main challenge is to handle the lack of state 
information at the destination. When the state is known in a strictly causal manner, it was shown in that in contrast 

to point-to-point channels, ignoring the state information is suboptimal. An improvement in achievable rates can be obtained 
by explicitly communicating the stale state information to the destination. The aforementioned papers accomplished this using 
block-Markov schemes that encode messages of the current block as well as some information about the state and messages 
from previous block. When the state information is known noncausally, the work GZl considered the dirty-paper special case 
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(additive interference composed of two components each known noncausally to one and only one encoder in Gaussian noise). 
For this case, a straightforward extension of Gelfand-Pinsker coding turns out to be highly suboptimal. Instead, a structured 
form of Gelfand-Pinsker coding using lattices is useful for achieving high rates, since it ensures that the overall interference at 
the destination concentrates on a small set. Finally, the case of common causal state information at the encoders was studied 
in which provided an inner bound using Shannon strategies. Recall that the multiple access channels studied in this paper. 
Figure and Figure assume also that the destination has full state information, while the encoders cooperate via partial 
cribbing, thus the main challenge in this paper is to optimally establish cooperation via cribbing among encoders that have 
disparate state information, rather than handling the lack of state information at the destination. 

A few works have considered state-dependent multiple access channels where the state information is available at the 
encoders as well as the decoder, also with no cooperation between the encoders. It was shown in Gg-iiD that optimal rates 
can be achieved by effectively treating the state components as time-sharing. These setups can be obtained as a special case of 
the setup in Figure by setting the partial cribbing links to zero, though the optimal schemes presented previously for these 
special cases do not provide the necessary insights for establishing cooperation among the encoders. In particular, the causality 
of cribbing requires the scheme to be block-Markov, and the encoding operation across blocks needs to be such that the two 
state components can be effectively treated as time-sharing, on top of establishing cooperation inspite of the disparateness of 
the state components. 

Cooperation in multiple access channels was studied by Willems in p2) and 0, wherein he introduced the notions of 
conferencing (orthogonal links) and cribbing respectively. In conferencing, dedicated orthogonal links are introduced for 
cooperation. Cribbing, in contrast, does not assume dedicated resources for cooperation. For example, cribbing can be thought 
of as exploiting the natural broadcasting nature of the wireless medium for cooperation. Decode-forward based schemes were 
proved to be optimal for multiple access channels with cribbing. However, the cribbing in Q was assumed to be perfect 
(noiseless). To account for the fact that perfect cribbing is unrealistic, the notion of partial cribbing, as described in the 
introduction, was studied in 0. and a partial-decode-forward based scheme was shown to be optimal. As mentioned in 
the introduction, the fact that a part of the message needs to be explicitly decoded in partial-decode-forward, renders its 
straightforward extension inapplicable for our purpose. 

There has also been interest in studying multiple access channels that include states together with some form of cooperation 
between the encoders, under various assumptions on the state information availability and the form of cooperation @-1^. 
All these works assume that whenever the state information is available, it is available noncausally. The capacity regions for 
noncausal state information only at one encoder were provided in p3) and | |2^ . The former additionally needed to assume 
that the informed encoder also knows the other message, while the latter assumed instead that there is a strictly causal or 
causal perfect cribbing link from the uninformed encoder to the informed encoder. The achievability schemes in both works 
used Gelfand-Pinsker multicoding at the informed encoder conditioned on the additional information received in the form of 
message cognition or cribbing. The capacity region for the case of conferencing encoders when noncausal state information is 
available at all nodes, including the destination, was provided in | [25) using the idea of double-binning. Achievable rate regions 
were derived in |26| and p7) , where the former considered perfect cribbing among encoders with noncausal state information 
only at the encoders, while the latter replaced perfect cribbing by noisy cribbing. 


Organization 

The following section describes the models and notation. Section 11^ contains the formal statements of the main results 
described in the introduction. A toy example is considered in Section |IV| for the purpose of explicitly illustrating the advantage 
of cooperative-bin-forward over partial-decode-forward. The following sections contain the proofs of the main results. We 
conclude by describing some open problems in section 

II. System Models 

As standard, capital letters denote random variables, small letters denote realizations, and calligraphic letters denote the 
alphabet of the corresponding random variable. The notation stands for the e-strongly typical set of sequences for the 
random variables in context. 


A. State-Dependent Semideterministic Relay Channels 

The state-dependent semideterministic relay channel is depicted in Figure and described by the pmf 
Ps{s)pY\x,Xr,s{y\xTXrT s) and Z — z{X, S). The encoder and decoder have causal state information. So a (n, 2"^,e) 
code for the above channel consists of the source encoding, relay encoding and decoding functions: 

x^:[l: 2”^] X 5* ^ A", l<i<n, 

Xr,i ■ —)• Xr, I < i < n, 

m-.y^ xS^ ^[1: 2 "'^], 
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such that 

Pr{TO(y",S'") ^ M} < e, 

where M G [1 : 2"^] denotes the transmitted message. A rate R is said to be achievable if for every e > 0, there exists a 
(n,2"^,e) code for sufficiently large n. The capacity is defined to be the supremum of achievable rates. 

The state-dependent semideterministic relay-without-delay channel is depicted in Figure and described by the pmf 
Psis)pY\x,Xr,siy\^i^r, s) and Z = z{X,S). The difference from the previous setup is that the relay encoding function 
is now allowed to depend also on Zp 

Xr,i '■ Z'' ^ Xr, 1 < i < n. 

Note that here we need to restrict Z to be z{X, S), instead of z{X, Xr, S). 

B. State-Dependent Multiple-Access Channels 

The state-dependent multiple access channel with strictly-causal partial-cribbing encoders is depicted in Figure and 
described by the pmf PSi,S 2 {si, S 2 )PY\Xi,X 2 ,Si,S 2 {y\^i^ ^ 2 , Si, S 2 ) and Zi = and Z 2 = Z 2 {X 2 ,S 2 ). The encoders 

have causal knowledge of the corresponding state components, but no knowledge of the other state component. The decoder 
is assumed to know both the state components. A (n, 2"^i, 2"^^^, e) code for the above channel consists of the encoding and 
decoding functions: 

x^i : [1 : 2”-^i] X X ^ Xi, 1 < i < n, 

X 2 ,i : [1 : 2”-^"] X 5^ X ^ X 2 , 1 < i < n, 

TOi : 3^” X X 5^ ^ [1 : 2”^i], 

TO2 : 3^” X X ^ [1 : 2”^^], 

such that 

Pi{(mi(Y-,S^,S^),m2(Y^,S^,S^)) ^ (Mi,M2)} < e, 

where Mi G [1 : 2”^^] and M 2 G [1 : 2"^^] denote the transmitted messages. A rate pair (i?i, R 2 ) is said to be achievable if 
for every e > 0, there exists a (n, 2”^\ 2 "^=^ code for sufficiently large n. The capacity region is defined to be the closure 
of the achievable rate region. 

The “without-delay” variation of this setup, also referred to as causal cribbing, is depicted in Figure where one of 
the partial cribbing links is changed from strictly causal to causal. So, the only difference from the previous setting is that 
X 2 i{M 2 ,S 2 , z\~^) is replaced by X 2 i{M 2 , ZJ). 


III. Main Results 

In this section, we describe the capacity regions for all the setups described in the previous section. The proofs are presented 
in subsequent sections. The achievability parts of all the theorems are accomplished by building on the idea of cooperative- 
bin-forward. 

The first result is taken from Q. We restate it here and provide a proof of the achievability in Section [V] using cooperative- 
bin-forward. Due to the simplicity of the setup, it serves well to bring out the main idea of the new scheme, before we describe 
results for the more complicated setups. 

Theorem 1. The capacity of the semideterministic relay channel, shown in Figure [T] is given by 

C= max vxmv{I{X,Xr]Y), H{Z\Xr)+I{X-Y\Xr,Z)}. (1) 

PX,Xri^<^r) 

The next result provides an expression for the capacity of the state-dependent semideterministic relay channel. 

Theorem 2. The capacity of the state-dependent semideterministic relay channel, shown in Figure is given by 

C= max miii{I{X,Xr]Y\S), H{Z\S,Xr) + IiX;Y\S,Xr,Z)}. (2) 

Px,. {X'r}Px\Xr.,si^\^r-,s) 

One difference between the capacity expressions of Theorem [T] and Theorem is that the mutual information and entropy 
terms involve a conditioning on S. Such an expression would also characterize the capacity if the relay is provided with the 
state information, and it would be achievable by performing partial-decode-forward while treating the state as a time-sharing 
sequence. It is quite interesting then that the capacity expression remains the same even when the relay does not have state 
information. However, the limitation is reflected in the fact that the choice of pmf is restricted to be PXr{Xr)Px\x,.,s{x\Xr, s), 
instead of Px,Xr\s{x,Xr\s). So, the cost of not having state information at the relay is reflected entirely in the limited choice 
of pmf 
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The following theorem states the capacity of the without-delay variation of the above case. The expression involves an 
auxiliary random variable, which allows the relay to perform instantaneous relaying on top of the binning. This can achieve 
maximal source-relay cooperation, as conveyed by the following theorem. 

Theorem 3. The capacity of the state-dependent semideterministic relay-without-delay channel, shown in Figure is given 
by 

C= max min{IiU,X-,Y\S),H{Z\U,S)+IiX;Y\U,Z,S)}, (3) 

PumPxlU,s(^\'>J-,S),Xr=Xr{U,Z) 

where \U\ < \S\ {\X\\Xr \ - 1) + 2. 

The capacity region for the setup of Theorem]^ in the absence of states is characterized in ||^ Proposition 7]. Setting S to 
be the empty random variable in Theorem recovers this result. Note that the objective in (|^ is the same as that in (|^ with 
Xr being replaced by U. However, the optimization in ([^ is over a different domain since the dependence of Xr on Z can 
now be chosen and is not specified by the channel. 

The next two theorems describe the capacity regions for the two multiple-access setups. 


Theorem 4. The capacity region of the state-dependent multiple-access channel with partially cribbing encoders, shown in 
Figure 1^ is given by the set of rate pairs satisfying 

Ri < I{Xi;Y\U, X2, Zi,SuS2) + H{Zi\U, Si), 
i?2 < IiX2; Y\U, Xi,Z2, Si,S2) + H{Z2\U, S 2 ), 
RiYR2<I{Xi,X2-,Y\U,Zi,Z2,Si,S2)+H{Zi,Z2\U,Si,S2), 
RiYR2<I{Xi,X2-,Y\Si,S2), 


for pmf of the form 

Pu{u)px-,\u,St i.xi\u,si)px.,\u,s^{x2\u, S2), 

with = zi{Xi,Si) and Z 2 = ^ 2 (^ 2 , ^ 2 ), with \U\ < min{|5i||52|(|3^| - 1) + 4, |5i||52|(|A’i||<T2| - 1) + 3}. 


Remark: It can be shown that the set described in the above theorem is convex, so there is no need to introduce an additional 
auxiliary random variable for time-sharing. 

As described earlier, the special case of no cribbing (obtained by setting Zi = Q and Z 2 = 0) has been considered in m- 
m- For this case, the last inequality becomes redundant and setting the auxiliary random variable U to be the time-sharing 
random variable in the statement of the above theorem is optimal. The resulting region recovers the results in the aforementioned 
papers. For the other extreme of constant states, i.e. = 0 and S 2 = 0, the capacity region in the above theorem recovers 
the result for strictly causal partial cribbing from Q. 

Theorem 5. The capacity region of the state-dependent multiple-access channel with partially cribbing encoders in the presence 
of a causal cribbing link, shown in Figure]^ is given by the set of rate pairs (i?i, R 2 ) satisfying 

i?i < I{Xi;Y\U, X 2 , Zi,Si,S2) + H{Zi\U, Si), 

R2<I{X2-,Y\U,Xi,Z2,Si,S2) + H{Z2\U,S2,Zi), 
Ri+R2<I{Xi,X2-,Y\U,Zi,Z2,Si,S2)YH{Zi,Z2\U,Si,S2), ^ ’ 

Ri+R2<I{Xi,X2-,Y\Si,S2), 


for pmf of the form 


Pc/ (mWi ic/.Si (a;i |u, Si )PX 2 1 (7.52.Zi (a;2 |u, S2, 21 ), 


with Zi = zi(Xi,^i) and Z 2 = 22 (^ 2 , ^ 2 ), with \U\ < min{|5i||52|(|3^| - 1) + 4, |5i||52|(|A’i||A’2| - 1) + 3}. 

Note that if Px-i\u,S 2 ,Zi{x 2 \u, S 2 , zi) in Theorem |^is replaced by Px 2 |c/,S 2 (^ 2 |'ti, S 2 ), then the region becomes identical to 
that in Theorem]^ Setting Si and S 2 to be constant retrieves the result for causal partial cribbing from Q. 


IV. Illustrative Example 

Consider the following special case of Figure 
Let the state S be the ternary random variable 


[p/2, 

if s = 0 

Ps{s) = < p/2, 

if s = 1 

[1 - P, 

if s = 2 
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where p < 1/2. The other variables are all binary. The channel z{X, S) is the memory with stuck-at faults channel considered 
in 128 Figure 7.7], while the channel PY\x,Xr-,S is specialized to be a noiseless channel from Xr to Y. Formally, 


fO, if S' = 0, 
z(X,S)=il, ifS=l, 
[x, its = 2, 

Y = Xr. 


Recall that the source and the destination know the state information causally while the relay has no state information. 

If the relay is required to decode the message, motivated by the optimality of decode-forward in the case of a line network 
with no state, the achievable rate is limited to be no more than the capacity of the memory with stuck-at faults channel when 
the state is known causally only to the source, which is 1 — 772 (|) • We point out that this cannot be improved by using 
partial-decode-forward, because the absence of a direct link between the source and destination means that any part of the 
message that is not decoded by the relay cannot be communicated to the destination in any manner. However, a higher rate 
can be achieved if the relay simply forwards its received signal, resulting in an effective channel between the source and the 
destination that is the memory with stuck-at faults channel with state known causally both to the source and the destination. 
The capacity of this channel is 1 — p, which is achieved by multiplexing at the source and demultiplexing at the destination 
according to the observed state. Thus, a rate 1 — p which is higher than 1 — H 2 (|) can be achieved. 

What if the channel from the relay to destination is not a noiseless bit-pipe, but a general noisy channel with capacity at 
least 1 — p? The rate 1 — p can still be achieved if the operation at the relay is changed from simply forwarding to randomly 
binning its received signal into « bins and forwarding a codeword corresponding to the chosen bin. To recover the 

message, the destination can first decode the bin-index. Since the destination has state information, it can reconstruct the 
state-multiplexed codebook at the source. Hence, it can recover the message by finding the unique source codeword, if any, 
that results in the received signal at the relay falling in the correct bin. 

The above example serves to illustrate the limitation of partial-decode-forward when nodes have different side-information. 
This example did not require cooperative transmissions from the source and the relay, because the source transmission did not 
directly affect the received signal at the destination. When there is also a direct link between the source and the destination, as 
allowed in the general models that we consider in this paper, the source and relay need to perform the bin-forward operation 
in a cooperative fashion. 


V. Proof of Theorem[T] 

We demonstrate in this section the achievability of capacity for the semideterministic relay channel using the new scheme 
cooperative-bin-forward. As described in the introduction, this scheme does not require the relay to decode part of the message. 
Instead, the relay simply bins its received signal and maps the bin-index to a codeword to be transmitted in the next block. 
As in partial-decode-forward, the codebooks at the source are generated conditioned on the relay’s transmission, resulting in 
coherent cooperation. The scheme is formally described next. 

Proof: 

Fix a pmf px,Xrix, Xr) and e > 0. Split R as R' + R", with the message M denoted accordingly as (M/ M"). Divide the 
total communication time into B blocks, each of length n. 

Codebook Generation: 

For each block 6 G [1 : B], a codebook is generated independently of the other blocks as follows. 

- Cooperation codewords 

Generate 2""^ codewords a;/{,(4-i) i-i.d. according to pxr^ where Z^-i S [1 : 2"’^]. 

- Cribbed codeword^ 

For each 4-i, generate 2”^ codewords 2 ^(toj|4-i) according to n”= iPz\Xr{-\xrbi{lb-i)), where mj, G [1 : 2^^ ]. 

- Transmission codewords 

For each 4-i and each 777 /, generate 2”'^ codewords x^{m'l\lb-i,m'f), where m/ G [1 : 2"^ ] according to 

]Xl=iPx\Xr,z{-\xru{h-i),zu{m'^\lb-i)). 

- Binning 

Partition the set of all Z” into 2”^ bins, by choosing a bin for each z” independently and uniformly at random. Denote 
the index of the chosen bin for z” by binf,(z"). 


^Given the analogy of the source-to-relay link in the relay channel with the cribbing link in the multiple-access channel with cribbing encoders, we call 
the codewords as cribbed codewords. 







Encoding: 

Fix Iq — 1 and = (1,1). Since the message in the last block is fixed, the effective rate of communication is 

which can be made as close as desired to R by choosing a sufficiently large B. 

In block b, It-i is known to the source encoder. To communicate message mt = it transmits 

The relay transmits x"j(4-i). Due to the deterministic link from source to relay and the codebook construction, the received 
signal at the relay in block b is the codeword The source and the relay set lb to be the index of the bin containing 

Zbim'b\lb-i)- 

Decoding: 

The decoder performs backward decoding, starting from block B and moving towards block 1, performing the following 
two steps for each block b: 

(1) Assuming that lb is known from previous operations, the decoder, for each Ib-i G [1 : 2"^], finds the unique such 
that 

binf,{z^(m't,|Z6_i)) = k. 

Whenever a unique to), cannot be found for some 4-i, the decoder chooses any to), arbitrarily. So after this operation, 
the decoder has chosen one to), for each Ib-i, given its knowledge of lb- We will signify this explicitly by denoting the 
chosen message as to),(4-i,4). 

(2) Now the decoder looks for the unique {lb-i,m'^) such that 

(x)?,(4_i), z^im'b{lb-iJb)\ib-i) , x^(to;'|4_i,toU4-i,4)) , y?) G (6) 

Note that the first step does not depend on the received signal in block b at the destination. However, it depends on the 
received signal in block & + 1, due to the involvement of 4. 

Probability of Error: 

The following error analysis reveals that in order to achieve the highest rate, the scheme will set i?' « i? « H{Z\Xr). It 
is easy to see that when R' « H{Z\Xr), given its knowledge of 4-i, the relay can indeed recover to),, even though it is 
not required to do so in this new scheme. In other words, for a given Ib-i, since R' « R, each message to), is mapped to a 
different bin, and therefore cooperatively communicating the bin index is indeed equivalent to cooperatively communicating 
the partial message to),. Thus, cooperative-bin-forward for this basic setup is indeed equivalent to partial-decode-forward. We 
will see however in the next section that when we have states, even though we still set R' « R, the relay will not be able to 
decode any part of the message so the binning aspect of the scheme will be instrumental. 

By symmetry, we can assume without loss of generality that the true messages and bin-indices corresponding to the current 
block are all 1, i.e. 

{Lb-i,MiM'') = (1,1,1). 

We bound the probability of decoding error in block b conditioned on successful decoding for blocks {B, B — 1,... ,b + 1}, 
averaged over the randomness in the messages and codebook generation. In particular, successful decoding in block 6-1-1 
means that Lb has been decoded successfully, where we remind ourselves that 

Lb = B\nbiZ^(l\l)). 

An error occurs in block 6 only if any of the following events occur; 

(а) M'(l,Lb)^l, 

(б) {Lb-i,M'f) ^ (1,1) given M'(l, L,) = 1. 

We analyze the two events in the following. One can notice from the above partitioning of the error events that we are 
ensuring that M'^(lb-i, Lb) is equal to 1 only for 4-i = Lb-i = 1, and not worrying about what Lb) is for any other 

value of 4-1. However, it is still important to fix at most one to), pair for each 4-i, even if it is arbitrary for all Ib-i 7 ^ 1, 
which is what the first step of the decoding does|^This allows us to restrict our attention to at most 2"^^+^ ) options while 
analyzing the probability of decoding Lb-i incorrectly during the second decoding step, instead of 2"^^+^ 4 

^Of course, one could also discard any Ib—i for which a unique Lb) cannot be identified; we stick to making one ai'bitraiy choice in such cases 

only because it makes the exposition simpler. 
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Event (a): M^(l,Lb) ^ 1; 

We have 

Pr^M^(l,Lft) l) = Pi'(Bint,(Zl^(m'i,ll)) = Lt for some toJ, > 1 ) 

= Pr(Binf,(Z^(m'f,|l)) = B\nb{Z^{l\l)) for some > 1) 

< ^ Pr(Bin,(Z,"K|l)) = Bin,(Z,”(l|l))) 

m'^>l 

= ^ Pr(Bin,(Z,"K|l)) = Bin,(Z,"(l|l)), Z,"K|1) = Z,"(l|l)) 

m'^>l 

+ ^ Pr(Bin,(Z,"K|l)) = Binfe(Zr(l|l)), Z^{m[\l) ^ Z^{1\1)) 

m'^>l 

< ^ Pr(Z"K|l) = Z,"(l|l)) 

m'^>l 

+ ^ Pr(Bin6(Z,"K|l)) = B\nb{Z^m) \ Z,"K|1) ^ Z,"(l|l)) 

(a) ~ 

^ qtiR' ^ —^(e)) _j_ ^ c^ — nR 

where 

- (i) follows since Lb = Bin;,(Z^(l|l)), 

- the first term in [ii) is obtained because for ^ 1, the codewords Z^{rn'^\l) and Z^(l|l) are generated independently 
according to Y\^^iPz\Xr{'\^rbi{lb-i)), and the second term in {ii) arises because the binning is performed uniformly at 
random and independently for each sequence, 

and we use ^(e) to denote any function of e for which i5(e) —?► 0 as e —?> 0. Hence, we get that 

Pr (^M'b{l, Lb) 7 ^ —>■ 0 as n —>• 00 , 

if the following two constraints are satisfied: 

R' < R, (7) 

R'< H{Z\Xr)-6{e). (8) 


Event (b): {Lb-i,Mb ) ^ (1,1) given M^(l,Lb) = 1; 

The probability of this event is upper bounded by 

Pr ^Condition (|^ is not satisfied by (Z{,_i, toJ() = (1,1) | Ml{l, Lb) = 1 j 

+ Pr I^Condition (|^ is satisfied for some (4 -i,to(() ^ (1,1) | M^(l,Lb) = 1^ . 

The first term goes to zero as n —> oo by the law of large numbers. The second term can be analyzed by standard applications 
of the packing lemma p8| as follows: 

Pr I^Condition (|^ is satisfied for some {Ib-i^rUb) ^ (1,1) | Mb{l^Lb) = 1^ 

< ^ Pr((x,",(i),zr(i|i),x,"(m;'|i,i),yn 


E 


Pr 


((X,",(4_i), Z’^{m'b{ib-i,Lb)\lb-i) , X^im''\ib-i,m'b{tb-i,Lb)) , n”) G 


^ 2nR" 2-n{I(X-Y\Xr,Z)-S{e)) 2n(R+R'')2-'^(I(^Xr,Z-,Y)-S{e)) 


which follows by applying the packing lemma. Thus, we get that 

Pr((Vi,Mn ^ (1,1) I lW'(l,Lb) = l) ^0, as n^oo. 


R” < I{X-,Y\Xr,Z)-S{e), 
R + R” < I{X,Xr;Y)-d{e). 


if 


(9) 

( 10 ) 
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Performing Fourier-Motzkin elimination of R, R' and R" using the rate constraints and also R — R' + R", letting n —?► oo, 
B ^ oo and e —0, we get that the rates specihed in Theorem are indeec^achievable by the scheme presented in this 
section. One can also simplify the Fourier-Motzkin elimination step by setting R to be R' + S{e). Since the converse part of 
this theorem is already known, we do not repeat the arguments here. ■ 

VI. Proof of Theorem[2] 

The achievability part of the theorem is similar to the scheme presented in the previous section. Due to the availability of 
causal state information at the source encoder and the decoder, the source encoder constructs codebooks for each state symbol 
and treats the state sequence as a time-sharing sequence (i.e. it performs multiplexing). Note that since the relay does not have 
state information, it might not be able to decode part of the message. However, it can still perform the bin-forward operation, 
allowing us to establish coherence between the source and the relay transmissions without sacrihcing unnecessarily on the 
rate. The decoding is also similar to the previous section, except for the extra demultiplexing component. The converse part 
of the theorem is presented towards the end of this section. Thus, we see that while partial-decode-forward cannot be applied, 
cooperative-bin-forward allows us to achieve the capacity region of the state-dependent semideterministic relay channel. 


Proof: 

Fix a pmf PXr{Xr)px\Xr-,s{x\Xn s) and e > 0. Split R as R' + R”, with the message M denoted accordingly as (M', M"). 
Divide the total communication time into B blocks, each of length n. 
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Fig. 7: The figure depicts the cribbed codewords generated for encoding m' for a given a:”(Z). Each node corresponds to a 2 : symbol that is generated 
independently according to Pz\x.r,s{'\^r,i(l), s). The red circles show how encoder 1 ch oose s the codeword if it wants to transmit m' = 2 and observes 
s" = (0, 0,1, 0,1,0). This construction is not identical but equivalent to that described in |28| Section 7.4.1]. 


Codebook Generation: 

For each block 6 S [1 : B], a codebook is generated independently of the other blocks as follows. 

- Cooperation codewords 

Generate 2”-^ codewords a;"j(4-i), i.i.d. according to pxr^ where 4-1 G [1 : 2"^]. 

- Cribbed codewords 

For each 4-i and each s G S, generate a codebook of 2”^ codewords. The jth symbol of such a codeword is chosen 
independently according to Pz\Xr,s{‘\^rbi{lb-i), s). The result of this is that for each R-i, each m'f, € [1 : 2”^ ] and each 
s" = {sbi, Sb 2 , ■ ■ ■, Sbn), the source encoder can form an effective codeword 2 ^(toJ,|4-i, ), whose ith symbol can be 

causally chosen as the ith symbol of the m^-th codeword from the codebook corresponding to Ib-i and Sbi- See Figure 

- Transmission codewords 

For each 4-i, each G [1 : 2”^ ] and each s G S, generate a codebook of 2"^ codewords. The ith symbol of 
such a codeword is generated independently according to px\x.r,z,s{'\xrbi{lb-i), Zbi{m'f^\lb-i, s), s). The result of this 
construction is that for each Ib-i, each G [1 : 2"^ ], each m'j,' G [1 : 2"^ ] and each , the source encoder can form 
an effective codeword ), whose ith symbol can be causally chosen as the ith symbol of the to" -th 

codeword from the codebook corresponding to R-i, m'^ and Sbi- 

- Binning 

Partition the set of all Z" into 2”^ bins, by choosing a bin for each z” independently and uniformly at random. Denote 
the index of the chosen bin for z" by binf,(z"). 

Encoding: 

Fix 4 = 1 and (m^jTOg) = (1,1)- Since the message in the last block is hxed, the effective rate of communication is 
i?, which can be made as close as desired to R by choosing a sufficiently large B. 

In block b, Ib-i is known to the source encoder. To communicate message nib = mlf), it transmits a;"(TO^'|4-i, toJ,, s"). 
The relay transmits Due to the deterministic link from source to relay and the codebook construction, the received 
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signal at the relay in block b is the codeword s'^). The source and the relay set to be the index of the bin 

containing 

From the encoding operation described above, we can see that the label 4 depends on (4_i, s^). We do not require 

the relay to decode but the source and the relay can still establish cooperation by directly performing a binning on the z'^ 
codeword to agree on the codeword to be used in the next block, thus providing the scheme with the title “cooperative- 
bin-forward”. The term cooperative is added to emphasize that the source and the relay agree on the binning and transmit 
coherently. Thus, the scheme achieves cooperation by communicating 4 to the relay, instead of m^. While the relay is not 
required to decode the partial message, we still need the destination to be able to decode all parts of the transmitted message 
successfully. In the following, appropriate conditions are imposed so that the destination can utilize the state information at its 
disposal to achieve successful decoding. 


Decoding: 

The decoder performs backward decoding, starting from block B and moving towards block 1, performing the following 
two steps for each block b: 

(1) Assuming that If, is known from previous operations, the decoder, for each 4-i G [1 : 2"^], finds the unique such 
that 

b\nb{Zb{mb\lb-i,Sb)) = lb. 


Whenever a unique mj, cannot be found for some Ib-i, the decoder chooses any m'j, arbitrarily. So after this operation, the 
decoder has chosen one for each 4-i, given its knowledge of lb and We will signify this explicitly by denoting 
the chosen message as 4)- 

(2) Now the decoder looks for the unique (4-i,mj) such that 


(x^biib-i), z^{m'^,{ib-l,s]^Jb)\ib-l,Sb), x]^{mb\ib-i,mbiib-i,Sb,lb),Sb), s^,yb'j G (11) 


Probability of Error: 

In the following error analysis, we will observe that in order to achieve the largest rate the scheme will set R' k, H{Z\Xr, S). 
The causal multiplexing-demultiplexing strategy proposed above effectively creates a different codebook for mj, for each typical 
sequence. The total number of zj^ codewords constructed by the source encoder is therefore « ■ R' « 

However, these codewords cannot be distinct since there are only « distinct sequences z^ (conditioned on 4-i)- 

This implies that multiple (s^, m'f) pairs will be mapped to the same codeword and therefore, the relay will be not be able 
to decode due to the lack of state information. 

By symmetry, we can assume without loss of generality that the true messages and bin-indices corresponding to the current 
block are all 1, i.e. 

= ( 1 , 1 , 1 ). 

We bound the probability of decoding error in block b conditioned on successful decoding for blocks {B, B — 1,... ,b + 1}, 
averaged over the randomness in the messages and codebook generation. In particular, successful decoding in block 6-1-1 
means that Lb has been decoded successfully, where we remind ourselves that 

Lb = B\nb{Z-{l\l,Sf)). 

An error occurs in block b only if any of the following events occur: 

(а) M'{l,S^,Lb)fl, 

(б) (Tfc-i, M") ^ (1,1) given M'(l, Sf, Lb) = 1. 

We analyze the two terms events in the following two subsections. 

Event [a): M^(l, 5”^, 7^ I-' We have 

Pr[M{,{l,Sb,Li^b) f l) = Pr(Bin{,(Z^(m(,|l,S'^)) = Lb for some m'b > 1) 

= Pv{B\nbiZ^im'b\l,S^)) = B\nb{Z^il\f Sf)) for some m'b > 1) 

< ^ Pr{B\nbiZfim'b\l,S^)) = B\nb{Z^{l\l,Sf))) 

= ^ Pr{B\nbiZfim'b\l,S^)) = B\nb{Z’fil\l,Sf )), Z^(m'b\l, Sf) = Zfil\l, S^)) 

+ ^ Pr{B\nb{Z^im'b\l,S^)) = B\nb{Z^{l\l, Sf)), Z^m'blfSf) f Zf{l\l,S^)) 
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< ^ Pr{Z^{m',\l,S^) = Z^{l\l,S^)) 

m'^>l 

+ J2 Pr{B\n,{Z-im',\l,S^)) = B\n,{Z-{l\l,S^))\Z-{m[\l,S^)^Z-{l\l,S^)) 

K>i 

^ ^nR' _ 2—n{H{Z\Xj.,S) — S{€)) _|_ quR! ^ q—uR 

where we use 6{e) to denote any function of e for which 5(e) —> 0 as e —?► 0. Hence, we get that 

Pr S'^, Lb) 0 as n —?► oo, 

if the following two constraints are satisfied; 

R' < R, (12) 

R' <H{Z\Xr,S)-5{e). (13) 

Event [b): (1) 1) M^(l, 5'^, Lb) = 1: The probability of this event is upper bounded by 

Pr ^Condition ( [TT] ) is not satisfied by (Zb_i, to(() = (1,1) | Mj(l, 5*^, Lb) = 1^ 

+ Pr ^Condition ( [TT] ) is satisfied for some ^ (1,1) | M^(l, Lf,) = . 

The first term goes to zero as n —> oo by the law of large numbers. The second term can be analyzed by standard applications 
of the packing lemma | |28| as follows: 

Pr ^Condition ( fTTj l is satisfied for some to(() ^ (1,1 ) | M^(l, SJ), Lb) = 1^ 

< ^ Pr(^{X:;bi^),Z^il\l,S^),X^im'^\l^,S^),S^,YbneR-^ 

+ ^ Pr 

^ 2nR"2-r>.(I(X-,Y\Xr-,Z,S)-S(e)) 2n(R+Ii")2-^iRXXr,Z;y\S)-S(e)) 


, Z^im'b{lb-i,S^, Lb)\ib-i,S ^), mU/Vi, 5,", 4), , S^, Y^) e 


which follows by applying the packing lemma. Note that when Ib-i > 1, it so happens due to the codebook construction that 
is independent of all the other sequences for any value of (m^, m'^). So the joint distribution of the sequences has the same 
factorization no matter what is chosen for Ib-i > 1. The only fact that matters for our analysis is that at most one m'^ has 
been chosen somehow for each Z^-i > 1- This allows us to write the fourth event as the union of at most 2"^^+^ '> events, 
where each corresponds to a different value of (4-i,m((). 

Thus, we get that 

Pr((4_i,Mn^(l,l)|M'(l,L,) = l) ^0, asn^cx), 


if 


R" <I(X-Y\Xr,Z,S)- 5 {e), (14) 

R + R” <I{X,Xr]Y\S)- 5 {e). (15) 

Performing Fourier-Motzkin elimination, letting n ^ oo, B ^ oo and e —0, we get that the rates specified in Theorem 
are indeed achieved by the achievability scheme presented in this section. 


Converse: 

Given a reliable code, we have by Fano’s inequality 

7T(M|y”,5")<ne„, 

where e„ —0 as n —oo. Then, we prove the first bound on R as follows: 
nR = H{M) 

= H{M\S^) 

= 

= H{Z^\S^) + H{M\Z'^,S'^) 

C) 

< H{Z^\S^) + I{M; 5”) + ncn 


13 


^ ^ I{M, X,-Y,\Y^-\Z^, 5") + ne„ 

i^l i^l 

n n 

Y, HiZ.lS'-^, Z^-\X,^,) + Y HM, X,; Y,\Y^-\Z^, S^, X,”) + ne„ 

i^l i^l 

(/) ^ ^ 

i^l i^l 

= nH{ZQ\SQ,XrQ, Q) + nI{XQ;YQ\ZQ, Sq, XrQ, Q) + ne^ 

< nH[ZQ\SQ,XrQ) Y nI{XQ-,YQ\ZQ,SQ,XrQ) +ne„, 

where 

- Q is a random variable uniformly distributed over [1 : n], independent of (X", X”, S'”, X"), 

- (a) follows because M is independent of S”, 

- (6) follows because X" is a function of M and S", 

- (c) follows by Fano’s inequality, 

- (d) follows by the chain rule of mutual information and because Xi is a function of (M, S”), 

- (e) follows because X^^i is a function of 

- (/) follows because conditioning reduces entropy and Y^ is independent of other random variables given (Xi, X^^i, Si), 
and 

- the final step follows because conditioning reduces entropy and Q — (Xq, X^q, Sq) — Yq. 

The second bound on R is proved as follows: 

nR = H{XI) 

= H{M\S^) 

</(M;F”|5”)+ne„ 

= /(M,X”,X;;X”|5”)+ne„ 

n 

< Y^i^i’^r,i]Yi\Si) +ne„ 

= TiI{Xq, XrQ', FqIiSq, Q) + n£n 

< nI{XQ,XrQ;YQ\SQ) +ne„. 

Thus, we have 

i? < min(/(XQ, X,Q; Xq I ), Tf (Zq I , X,Q) + J(Xq ; Xq I Zq , 5q , X,Q)) + e„. 

Note that 

- Sq is independent of Q and has marginal pmf ps due to the i.i.d. assumption on the state; 

- since Si is independent of Xr^i = Xr,i{Z'^~^) for all 1 < i < n, we have that Sq is independent of X^g; 

- we have that PYq\Xq,x,.q,Sq {y\x, Xr, s) is equal to PY\x,Xr-,s{y\x, Xr, s), since Yq is the output of the channel when the 
inputs are (Xg, X^q, S'q), and 

- similarly, we also have Zg = z{Xq, X^q, Sq). 

Hence the joint pmf of the random variables (Xg, X^g, Sq, Yq) factorizes as 

PSq,Xq,X,.q,Yq ( s , X, Xr, y) 

= PSQ{s)pXrQ{Xr)pXQ\XrQ,SQ{x\Xr,s)pYQ\XQ.XrQ,SQ{y\x,Xr,s) 

= Ps{s)pXrQ iXr)PXQ\XrQ,SQix\Xr,s)pY\X,Xr,siy\x,Xr,s). 

So, we can define the random variables X = Xg, Xr = XrQ, S = Sq, Z = Zq and X = Xg to get 

R < min{/(X, X,.; X|S'), H{Z\S, Xr) + /(X; X|Z, S, Xr)} + e„, 

where the pmf of the random variables has the foxmps{s)pXr{xr)Px\Xr,s{x\xr, s)pY\x,Xr,s{y\x, Xr, s) and Z = z{X, Xr, S). 
Since e„ —0 as n —oo, the converse is proved. 

This concludes the proof of Theorem ■ 


VII. Proof of Theorem[3] 

The achievability part of this theorem is obtained by combining the cooperative-bin-forward scheme from the previous 
section with instantaneous relaying. This requires an auxiliary random variable, as described next. 
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Proof: 

Fix pu{u)px\u,s(.x\'u, s), Xr = Xr{u,z) and e > 0. Split R as R' + R", with the message m denoted accordingly as 
M = {M', M"). Divide the total communication time into B blocks, each of length n. 

Codebook Generation: 

For each block 6 € [1 : B], a codebook is generated independently of the other blocks as follows. 

- Cooperation codewords 

Generate 2”-^ codewords i.i.d. according to pjj, where Ib-i G [1 : 2"^]. 

- Cribbed codewords 

For each 4-i and each s G S, generate a codebook of 2”^ codewords. The ith symbol of such a codeword is chosen 
independently according to Pz\u,si'\ubi{lb-i), s). The result of this is that for each Ib-i, each m'b € [1 : 2"^ ] and each 
Sb = (sbi, Sb 2 , ■ • •, Sbn), the source encoder can form an effective codeword 2 :^ Sb), whose ith symbol can be 
causally chosen as the ith symbol of the m'f,-th codeword from the codebook corresponding to 4-i and Sbi- 

- Transmission codewords 

For each Ib-i, each m'f, G [1 : 2"^ ] and each s G S, generate a codebook of 2"^ codewords. The ith symbol of such a 
codeword is generated independently according to Px\u,z,s{'\ubi{lb-i), Zbi{m'i^\lb-i, s), s). The result of this construction 
is that for each Ib-i, each mj, G [1 : 2"^ ], each G [1 : 2”^ ] and each the source encoder can form an effective 
codeword x'^{m'^\lb-i,rn'i^, ), whose ith symbol can be causally chosen as the zth symbol of the m" -th codeword from 

the codebook corresponding to Ib-i, and Sbi- 

- Binning 

Partition the set of all Z" into 2"^ bins, by choosing a bin for each 2 " independently and uniformly at random. Denote 
the index of the chosen bin for 2 ” by binf,( 2 :"). 


Encoding: 

Fix lo = 1 and — (I?!)- Since the message in the last block is fixed, the effective rate of communication is 

^^^R, which can be made as close as desired to R by choosing a sufficiently large B. 

In block b, assuming Z^-i is known to the source encoder, it transmits Xb{m'^\lb-i,mb, Sb)- The relay transmits x”},, the 
zth symbol of which is obtained as Xr{ubi{lb-i), Zbi{rn'ij\lb-i, s'^)). At the end of block b, the source and the relay set lb to 
be the index of the bin containing s^). 


Decoding: 

The decoding operation is nearly the same as that in the previous section. The decoder performs the following two steps 
for each block b, where b G {B, B — 1, ■ ■ ■ ,1}: 

(1) Assuming that lb is known from previous operations, the decoder, for each 4-i G [1 : 2"^], finds the unique such 
that 

b\nb{zf{mb\lb-i,s]^)) = lb. 


Whenever a unique mj, cannot be found for some Ib-i, the decoder chooses any arbitrarily. So after this operation, the 
decoder has chosen one r?z.^ for each Ib-i, given its knowledge of lb and We will signify this explicitly by denoting 
the chosen message as rhb{lb-i,Sb,lb). 

(2) Now the decoder looks for the unique {lb-i,rh'f) such that 


(zzn4-i), gT/"). (16) 


Probability of Error: 

By following a similar path as the previous section, we get the following conditions for vanishing probability of error; 

R' < R, 

R! < H{Z\U,S)-5{e), 

R!' < I{X-Y\U,Z,S)-5{e), 

RPR” < I{U,X-,Y\S)-5{e). 


Performing Fourier-Motzkin elimination completes the proof of achievability. 
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Converse: 

Given a reliable code, define for each 1 < f < n, the random variable Ui = Note that with this definition, 

Xr^i becomes a function of {Ui,Zi). We have for any reliable code, by Fano’s inequality, 

H{M\Y^,S^) < ncn- 

Then, 


nR = H{M\S^) 

= H{M, Z"|S'") 

< iJ(Z"|S'") +/(M;y”|S'”,Z") +ne„ 

n n 

< ^ Y,\Z^-\S^-\Z,, 5,) + ne„, 

i=l i=l 

where the final step uses the fact that Yi is independent of other random variables given Xi, Z'^~^, Zi, Si, since Xr^i is a 
function of Z^. Using the definition of Ui in the above, we get that 

n n 

nR<Y, H{Z,\U,, S,) + ^ I{Xf, Y,\U,,Z,, S,) + ne„ 

= nH{Z\UQ,SQ,Q) + nI{XQ-Y q\Uq, Zq, Sq, Q) + ne„, 


where Q is uniformly distributed over [1 : n] and independent of U'^,X'^,X'^, S'^,Y'^. 
The remaining bound on R is proved below; 


nR = H{M\S^) 

< /(M;y"|5")+ne„ 

n 

n 

^ I{M, X,,X,^,, 5") + nen 

n 

< Y, Xr,^, Z^-\S^-^-Y,\Si) + ne„ 

n 

= +ne„ 

n 

= YHU^,Xf,Y,\S,) + ner^ 

= nI{UQ,XQ;YQ\SQ,Q) + nen 

< nI{Q,UQ,XQ]YQ\SQ) + nen, 

where step (a) is true since {Xi, Xr^i, Z'^~^, is a function of {M,S'^), and step (b) follows because Xr,i is a function 

of {Ui,Zi), hence a function of {Ui, Xi, Si). 

Following similar arguments as the previous section, we can define U = {Q, Uq), X = Xq, X^ = X^q, S = Sq, Z = Zq 
and Y = Yq to get 

R < min{/(C/, X; FIS'), H{Z\U, S) + I{X; Y\U, Z, S')} + e„, 


where the pmf of the random variables has the form psis)pu{'U')px\u,s{x\u, s)pY\x,x,.,s{y\x, Xr, s), Z = z{X, S) and X^. = 
Xr{U, Z). Since > 0 as n —> 0, the converse is completed. The bound on cardinality of the auxiliary random variable can 
be obtained using arguments based on Caratheodory’s theorem as described in pS} Appendix C]. 

This concludes the proof of Theorem ■ 


VIII. Proof of Theorem|4] 

The achievability scheme is more intricate than the previous sections due to the additional complications in the model, but 
builds on the same idea. Each encoder in the multiple access channel has an operation similar to the source encoder of the 
relay channels considered in the previous sections. The source encoder of the relay channels controlled the signal received at 
the relay by employing rate-splitting and superposition coding. This signal was used to choose a cooperation codeword for 
the next block. For the multiple access channel, each encoder controls the received signal at the other encoder in the same 
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manner, so that at the end of a block, these two cribbed signals are known to both encoders, which are used to agree on a 
cooperation codeword for the next block. 

We point out the fact that it is crucial for both encoders to know both the cribbed signals at the end of a block, so that they 
can agree on a cooperation codeword for the next block. Encoder 1 knows zj, because it receives this signal. Since the model 
assumes that the cribbing link is of the form zi{Xi, Si), encoder 1 is able to control the z” signal received by encoder 2, 
and thus encoder 1 also knows z". Similarly, encoder 2 also knows zf and z^. If Zi were assumed to be zi{Xi,X 2 ,Si) or 
zi{Xi, X 2 , Si, S 2 ), then encoder 1 would not have knowledge of the received signal at encoder 2 due to the involvement of 
X 2 and S 2 , and it would not be possible to employ the scheme. The reason we are able to assume that the received signal at 
the relay in the previous sections is z{X, Xr, S) and not just z{X,S) is that the relay has no message of its own, so Xr in 
fact depends only on past signals transmitted by the source encoder, so the source encoder can still control the z" signal. 

Proof: 

Fix a pmf P£/(u)pxi|c/,Si(a^i|t‘> si)Px 2 |c/,S 2 (^ 2 |u, S 2 ) and e > 0. Split Ri as + R'l, with the message Mi denoted 
accordingly as and similarly split R 2 as R 2 + R 2 , with the message M 2 denoted accordingly as Divide 

the total communication time into B blocks, each of length n. In the achievability scheme proposed in 0 for the case of no 
state, M[ corresponds to the part of Mi that is decoded by encoder 2. As can be guessed based on the previous sections, this 
is not the case in the cooperative-bin-forward scheme presented below. 

Codebook Generation: 

For each block 6 S [1 : B], a codebook is generated independently of the other blocks as follows; 

- Cooperation codewords 

Generate codewords / 2 ,h-i), i-i-d. according topu, where h^-i S [1 : 2"^^] and ^ 2 . 6-1 G [1 : 2”^^]. 

In the following, we will sometimes abbreviate (Zi ;,_i, Z 2 , 6 -i) by h-i- 

- Cribbed codewords - I 

For each 4-i and each si G Si, generate a codebook of 2"^i codewords. The ith symbol of such a codeword is chosen 
independently according to Pzi|c/,Si (■|wbi(4-i)) Si)- The result of this is that for each ib-i, each m) G [1 : 2”'^i] and 
each = (sibi, sit, 2 , ■ • ■, sibn), encoder 1 can form an effective codeword z'^^{m\ b\^b-i, s"},), whose *th symbol can 
be causally chosen as the ith symbol of the m'^ f,-th codeword from the codebook corresponding to l^-i and sibi- 

- Cribbed codewords - II 

Similarly, for each Ib-i and each S 2 G ^ 2 , generate a codebook of 2"^2 codewords. The ith symbol of such a codeword is 
chosen independently according to Pz 2 \u,S 2 (■|t^b^(4-l)) 52 )- The result of this is that for each Ib-i, each ^ G [1 : 2 ”^=] 
and each = (■S 2 bij S 2 b 2 , ■ ■ • j S 2 bn), encoder 2 can form an effective codeword z: 2 b (’^2 bl^fc-i’ ^ 2 b)’ whose ith symbol 
can be causally chosen as the ith symbol of the f,-th codeword from the codebook corresponding to 4 -i and S 2 bi- 

- Transmission codewords - I 

For each Ib-i, each m\ ^ G [1 : 2"^i] and each si G Si, generate a codebook of 2"^i codewords. The ith symbol of 
such a codeword is generated independently according to Pxil/ 7 ,Zi,Si('l'>tbi(h-i), ^ibiitn'i bl^b-i, si), si). The result of 
this construction is that for each Ib-i, each m[ ^ G [1 : 2”^i], each G [1 : 2"^i ] and each s”^, encoder 1 can form an 
effective codeword x"i^(m[\jlb-i, tn'-^ bi^ib)’ whose ith symbol can be causally chosen as the ith symbol of the m[\-th 
codeword from the codebook corresponding to Ib-i, m[ ^ and sui- 

- Transmission codewords - II 

Similarly, for each Ib-i, each j, G [1 : 2 "^ 2 ] and each S 2 G S 2 , generate a codebook of 2"^2 codewords. The *th 
symbol of such a codeword is generated independently according to Px 2 \u,Z 2 ,S 2 i'\'‘^bii^b-i), Z 2 bi{'in 2 f,|Zb-i 5 S 2 ), 52 )- The 
result of this construction is that for each Ib-i, each m '2 G [1 : 2 "^ 2 ]^ each G [1 : 2 ”'^ 2 ] and each si^b, encoder 2 
can form an effective codeword b\h-i, m '2 S 25 ), whose jth symbol can be causally chosen as the jth symbol of 

the m 2 ft-th codeword from the codebook corresponding to Ib-i, m '2 b ■S 2 bi- 

- Binning 

Finally, partition the set Z" into 2”^i bins, by choosing a bin for each zf independently and uniformly at random from 
[1 : 2”^“^]. Denote the chosen bin for z" by bint,(z”). Similarly,partition the set Zf into 2”^^ bins, by choosing a bin 
for each Z 2 independently and uniformly at random from [1 : 2"^^]. Denote the chosen bin for Z 2 by binh(z 2 ). 

Encoding: 

Henceforth, whenever convenient, we will abbreviate {li^b-i,h,b-i) by h-i- Set {lip,l 2 ,i) = (1,1) and 

m'I s, m'2^B, rn'ls) = ( 1 , 1 , 1 , 1 ). 

Since the message in the last block is fixed, the effective rate of communication will be ^^^R 2 ), which can be 

made as close as desired to {Ri, R 2 ) by choosing a sufficiently large B. We now describe the encoding for block h. Assume 
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(u^{ib-l), zr6(w'l,6(4-l,S?b,^l.b) |4-l,S?h), Z^f,{m 2 ^f,{tb-l, S^^,l 2 ,b) \ h-l, S 2 b), 

Xibi^'lb I 4-1, mi,fa(4-i, s"b, Zi,b), s"b), x’^bim'lb I 4-i, m2,(,(4-i, s^b, h,b), s^b), s^f,, S 2 b, ^b ) G 7^^"^ (17) 


both encoders have agreed upon some 4-i = (^i,b-i, ^ 2 ,b-i) based on operations in previous blocks. Then, encoders encode 
messages mi^b and m 2 ,b by {,!?&_i, ^ib) ^ 2 bi ''^2 bl^b-hni^ {,, s^f,) respectively. This operation is valid because 

it does not require noncausal knowledge of the state sequences, it can be done “on the fly”. At the end of block b, both encoders 
have knowledge of the cribbed codewords Zib('m-[ bl4-i,Sib) and Z 2 b (^2 bl^b-ij ^^b)- They set 

h^b = b\nb{zu), and h^b = binb(z^t,). 


Decoding: 

The decoder performs backward decoding. For each block b G {B, B — 1, B — 2,..., 2}, assuming that lb = {li,b, h,b) is 
known from previous operations; 

(1) The decoder first takes a pass through all 4-i = ((i,b-i, ^ 2 ,b-i) and for each 4-i, finds the unique {m\ ^,^2 b) 
that 

bin6(z"b(TO4bl^b-i, Sib)) = Zi.b and bin6(4t,(m2_6|4-i, S 2 b)) = h,b- 

Whenever a unique b) cannot be found for some 4-i, the decoder chooses any {rn'i ^^^2 b) arbitrarily. So 

after this operation, the decoder has chosen one {m[ j) for each 4-i, given its knowledge of {lb, s”f,, s^b)- We will 
signify this explicitly by denoting the chosen messages as Th[ b(4-i, 4,b) and b(4-i, s^b, 4,b) respectively. 

(2) Now the decoder looks for the unique {lb-i,m” bii ^2 b) ^ttch that ([T7|) (appearing at the top of this page) is satisfied. 


Analysis of the Error Probability: 

By symmetry, we can assume without loss of generality that the true messages and bin-indices corresponding to the current 
block are all 1, i.e. 

(Lb-i, M'lb, M'lb) = (1,1,1,1,1). 

We bound the probability of decoding error in block b conditioned on successful decoding for blocks {B, B — 1,... ,b + 1}, 
averaged over the randomness in the messages and codebook generation. In particular, successful decoding in block 6-1-1 
means that (Ti,b, T 2 ,b) has been decoded successfully, where we remind ourselves that 


Li,b = Binb(^rb(l|l, S^b)) and L 2 ,b = Binb(Z 2 ”b(l|l, S^b))- 
An error occurs in block 6 only if any of the following events occur: 

(а) M(^,(l,^4,Li,b)y^l 

(б) M'_,(l,,52"b,L2.b)7^1 

(c) (Lb-i,M"b,M" J ^ (1,1,1) given (M( ,(1, ,54, Li,b), M'b(l, 52”b> T 2 .b)) = (1,1) 

We analyze each of the above three events in the following. 

Event {a): M[ f,(l, S'";,, Ti,b) ^ I-' We have 


Pr(M(^,(l,S4,Li,b)^l) 

= Pr (Binb(2'4(mi ft|l, S4)) = Li^b for some m[ b > l) 

= Pr(Binb(Z4(m;,b|l,5rb)) = Binb(Z4(l|l, S^)) for some ml,, > l) 

< ^ Pr(Binb(Z4(K,b|l,54)) = Binb(Z4(l|l,S4))) 


= ^ Pr(Binb(Zrb(m;,b|l,5i"b)) = Binb(Z4(l|l,S4)), Z4(ml,b|l, ^^b) = ^i"b(l|l, ^i^b)) 


+ ^ Pr(Binb(Z4(m;,b|l,S4)) 

m[h>t 


Binb(Z4(l|l,S4)), ZM,b\hS^b) ^ zm^s^b)) 


Pr iZUm[,b\h S^b) = ZUMh S^b)) 


< 



18 


+ ^ Pr(Bin,(ZrfeK,,|l,5i"b)) = Bm,iZUl\l,S^,)) \ 5^,) ^ ZUl\l,S^,)) 

™i,b>i 

^ 2^nB.[ _ 2^ — n{H(Zi\U,Si) — S{e)) _j_ 2^nR[ ^ c^ — nRx 

where we use 6 {e) to denote any function of e for which 5(e) —> 0 as e —?► 0. Hence, we get that 

Pr Sii^, Li^h) 7 ^ —> 0 as n —7 cxd, 

if the following two constraints are satisfied; 

R[ < Ri, 

R[<H{Z,\U,Si)-Sie). 

Event (b): ^(1, L 2 ,&) 7 ^ 1-' Similar to the previous subsection, we can conclude that 

Pr rM 2 ,h(l) ^ 2 ,b) 7^ l) —7 0 as n —>■ oo, 


if the following two constraints are satisfied; 


i?2 ^ -^2? 

R'^<H{Z2\U,S2)-6ie). 


Event (c): (Lb_i, J ^ (1,1,1) given (M( ,,(1, S'J),, Li^b), j,(l, 5'^^, L 2 ,b)) = (1,1)- The probability of this 

event is upper bounded by 

Pr (^Condition ([^ is not satisfied by (4_i, m" f,, m 2 ^f,) = (1,1,1) | (I'Tlb)!,'S'lb, Ti,b), M 2 ,b(l, S'Jt,, T 2 ,h)) = (1,1)) 

+ Pr (^Condition ([^ is satisfied for some (4_i, m",,, t) 7 I (1,1,1) | (M( ^(1, 5"^, Li.b),-M^^.b)!,T 2 .b)) = (1,1)) ■ 
The first term goes to zero as n —7 c» by the law of large numbers. 

The second term can be handled by considering the following four different cases separately and applying the pack¬ 
ing lemma appropriately in each case. 

- (Tb-i, M"„ M" J = ( 1 , 1 , > 1 ) given (M(_,(l, M',,(l, ^"b, T 2 .b)) = ( 1 , 1 ) 

- (Lb-i, = ( 1 , > 1 , 1 ) given (M{ ,( 1 , 5^,, Li.b), M^,b(l, ^ 2 ”b, L 2 ,b)) = ( 1 , 1 ) 

- (Lb-i, Mfb, M^,) = (1, >1, >1) given {M[ ,(1, Li.^), M' ,(1, L^^b)) = (1,1) 

- (Lf,_i,M"„M"J = (>!,*,*) 

A standard application of the packing lemma gives us that the probability of each of the first three events goes to zero as 
n —00 if the following constraints are respectively satisfied; 

i?" </(X 2 ; r|C/, ^ 2 , ^ 1 , ^ 1 , ^ 2 ) - 5(e), 
i?" < /(Xi ■,Y\U,Zi,X 2 , 81 , 82 )- 5(e), 

R'l + i?" < I{Xi,X 2 -, Y\U, Z^,Z 2 , 8 i, 82 ) - 5(e). 


Applying the packing lemma gives us the following condition for vanishing probability of the fourth event, 

R 1 +R 2 Y R!i + i ?2 < I{U, Z^,Z 2 , Xi,X 2 -, r|^i, 82 ) - 5(e) 


= I{X^,X2■,Y\8^,82)-5{e). 

Collecting all the constraints established so far, we have 

Ri < i?i, (18) 

<fT(Zi|f/,5i)-5(e), (19) 

i?2 < 1?2, (20) 

<7J(Z2|f/,52)-5(e), (21) 

R!i <I{X 2 -,Y\U,Z 2 ,X^, 81 , 82 ) - 5{e), (22) 

R'l <I{Xi,Y\U,Zi,X 2 , 8 i, 82 )- 5{e), (23) 

i?'/ + A-"</(Xi,X2;y|(7,Zi,Z2,51,^2)-5(e), (24) 

i?i-f i?2 + i?i +-R 2 </(2fi,A:2;y|S'i,5'2)-5(e). (25) 


Performing Fourier-Motzkin elimination of i?i, R 2 , i?(, R' 2 , R'l and R'^, and letting n —> 00 , B —7 c» and e —7 0, we get that 
communication at arbitrarily small error probability is possible for the rates specified in Theorem]^ 
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Converse: 

The proof of the converse can be constructed by using similar arguments as in Q. Note that we have by Fano’s inequality 
the following condition for any reliable code: 

H{Mi,M2\Y'^,S^,S^)<nen, 


where e„ —>■ 0 as n —oo. Define Ui as 

TT. A 7*-l c*-! C't-l'i 

— \^Zj- y ^ ^2 ^ 1 5 *^2 / 

An upper bound on Ri is established by the following: 

nRi = H{Mi) 

HiMi\M 2 ,S^,S^) 

H{Mi,Z^\M2,S^,S^) 

= H{Z' 1 \M 2 , S^, S^) + H{Mi\Z^, M 2 , S^, S^) 

< H{Z^\M2, S^, S^) + /(Mi; y"|zr, M2, 5 i", S^) + ne„ 

n n 

= ^ H{Zu\ZI-\M2, S^, S^) + Y, I{Mi-Y,\Y^-\Z^, M2, S’^, S^) + ne„ 


Z=1 


2=1 


^ H{Zu\ZI-\M 2 , S^, S^, zr^) + Y /(Mi; Z”, M2, X^) + ne. 


2=1 


2=1 


^ HiZu\Zl-\M 2 , S^, S^, Zr^) + Y Y,\Y^-\Z^, M 2 , S^, S^, X") + ne„ 


2=1 


2=1 


(/) 


'< ^ H{Zu\Zl-\Sl,Sl-\Z^^-^) + ^ I{Xu; Y,\Zl,SlSl X2i, Z^^) + ne„ 


2=1 


2=1 


= ^ H{Zu\U„ Su)+Y I{X^^\Y,\U^, Zu, Su, S 2 ^,X 2 ^) + ne„ 


2=1 


2 = 1 


= nH{ZiQ\UQ,SiQ,Q) +nI{XiQ-,YQ\UQ,ZiQ,SiQ,S 2 Q,X 2 Q,Q) + ne„ 


where 


- Q is a random variable uniformly distributed on [1 : n], independent of other random variables, 

- (a) follows because Mi is independent of (M 2 , S'", ), 

- (5) follows since Z^ is a function of {Mi, M 2 , 81 , 82 ), 

- (c) follows by Fano’s inequality, 

- {d) follows since (X 2 i,^ 2 i) is a function of {M 2 , 82 , Zl~^), 

- (e) follows since Xi^ is a function of (Mi, S", X^”^), 

- (/) follows since (i) conditioning reduces entropy and (ii) conditioned on (Xu, X 2 i, Sii, S 2 i), Yi is independent of 
(Mi,M2,Xi",X2",Si",S2"). 

similarly, we get an upper bound on R 2 : 

n/?2 < nH{Z2Q\UQ, 82Q, Q) + nI{X2Q', Yq\Uq, Z2Q, 8iq, 82Q,Xiq,Q) + ne„. 

Applying similar arguments to the sum rate, we get: 
n(/?i + /?2) =//(Mi,M2) 

= //(Mi,M2|Si",S2”) 

= H{Mi,M2 ,Z'1,Z1^\8^,8^) 

= H{Z'l, Zq\ 8 ^, S2") + //(Ml, M2IZ1", Z2”, ^ 1 , ^2 ) 

n 

< Y H{Zu, Z 2 ,\U,, 8 u, S2.) + //(Ml, M2IZ1", Z^, Sr, S2") 

2=1 

n 

< Y H{Zu, Z2,\U,,8 u, 82 i) + /(Ml, M 2 ; y”|Zi", Z 2 ”, 8 ^, 8^) + ne„ 

2=1 

n n 

< 'Y^ ^(Zii, Z 2 i\Ui, 8 ii, 82 i) + "Y^ I{Xii, X 2 i] Yi\Ui, Zii, Z 2 i, 8 ii, 82 i) + ne„ 

i=l i=l 

= nH{ZiQ, Z2 q\Uq, 8iq, 82Q, Q) + nI{XiQ, X2 q;Yq\Uq, Ziq, Z2Q,8iq, 82Q,Q) + ncn- 
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Another upper bound on the sum rate can be easily established as follows: 


n{Ri+R 2 ) = H{Mi,M 2 ) 

= H{MuM 2 \S^,S^) 

<IiAR,M2;Y^\S^,S^)+nen 

n 

< X2i', Yi\Sii, S2i) + nCn 

i=l 

= nI{XiQ,X2Q;YQ\SiQ,S2Q,Q) + ne„ 

< nI(XiQ, X2 q;Yq\Siq, S2q) + ne„. 


We note the following four conditions: 

- {SiQ, S2 q) is independent of {Q, Uq), and has the pmf S2), 

- XiQ - (Q, Uq,Siq) - {Q, Uq, S2q) - X2Q, 

■ PYQ|t/Q,2fiQ,X2Q,SiQ,S2Q(yl'“>^l>^2,Si,S2) is equal to py|jCi,X2,Si,S2 ( 2 /|^li 2:2, Si, S2), 

- ZiQ = zi{XiQ, Siq) and Z2Q = Z2{X2Q, S2q)- 

The proof of the first condition follows because {Si^i,S 2 ,i) are generated independently of Ui = {Zl~^ , Z 2 ~^ , Sl~^ , S 2 ~^) 
for all 1 < z < n, and due to the i.i.d. assumption on (Si, 82)- To prove that the second condition is satisfied, consider the 
following. For any 1 < z < rzj^ 


p{s\,sl,Xli,X2^,zl 

mi ,7712 


= E 

Till ,7712 


p{mi)p(m 2 )pis\, S 2 )pixu\mi, s\, Z 2 


{^P{zij\mi, sP^, zi 


X p(x2i\m2, s\, z\ 


l^^piz 2 j\m 2 ,s( 2 ,z{ 


^ =p(s\,S2)'^p{mi)p{xu,zl ^\mi,s\,zl ^)'^p(m2)p(x2^, Z 2 ^1^2, S 2 ,4 

TTli 7712 

= p{s\,sl)p(xu, zl~^\s\, zl~^)p(x2i, zl~^\sl, 


= p[s\, s\)p(xu\s\ 

, 4 - 

1 

5 ^2 

■ )p{4 

|Sl.4 )pix2i, 

4"'1^2 

i, ^i 

4- 


The above factorization implies that 









Xu- 



\ 2 ( 2-4 

— (X2i, S2i), 

for 

all 1 

< i 

: < n 

^ Xu - 

(U., 

Su) 

— i.P^2i, 

52.), 

for 

all 1 

< i 

i < n 

^ Xu - 

{U^, 

Sli) 

— (Ui, S2i, X2i), 

for 

all 1 

< i 

i < n 


XiQ - {Uq,Siq,Q) - (Uq,S2Q,X2Q,Q). 


Similarly, we also have 

X2Q - (Uq, S2Q,Q) - (Uq,Siq, XiQ, Q). 

These two Markov chains together imply the desired Markov chain Xiq — (Uq, Siq,Q) — (Uq, S2Q,Q) — X2q. The third 
condition follows by the dehnition of the auxiliary random variable Uq and because Yq is the channel output when the 
inputs are Xig, Ar 2 Q, Siq, S' 2 Q. Similarly, the fourth condition is true because Ziq and Z 2 Q are the cribbed signals due to 
(A:iq,S'iq) and (X 2 Q,S 2 q) respectively. 

So we can dehne random variables U = (Q, Uq), Xi = Xiq, X2 = X2Q, Si = Siq, S2 = S2Q, and Y = Yq, such that 
the joint pmf of these random variables has the factorization 

PSuS2{Sl,S2)pu{u)px^\U,S^ {Xl\u, Si)px^\U,S2i^2\u, S2)PY\X^,X2,S^,S2{y\^l^ ^2, 81,82), 


^We drop subscripts denoting the random variables when analyzing the factorization to reduce the length of the expressions. 

^The expressions from this line onwards, read in a literal manner, might seem strange for representing factorizations due to the presence of terms of 
the form Z 2 ~^)p{z 2 ~^\-, however these are valid factorizations, possibly containing redundant conditioning in some terms. The Markovity 

conclusions we draw from such a form are valid, because the factorization might at most contain redundant conditionings. An example might make the point 
clearer. Assume that a joint factorization is of the form p{x, y)p{x',y^)p{z\x, y')p(z'\x', y). If we marginalize by summing over {x, x') first and then over 
(z^z'), we get an expression of the form p{y\y')p{y'\y), while reversing the order of summation gives us p(y)p{y')- Thus, Y and Y' are independent, so 
the former expression is equal to the latter, though the former contains redundant conditioning. 
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and Z\ = zi{Xi^Si) and Z2 = 2:2(-^2, < 5 ' 2 ). Noting that e„ —?► 0 as n — >■ 00, the four constraints on the rates that we have 
established become 


Ri < I{Xi-,Y\U, X2, Z^, 81,82) + H{Z^\U, 5 i), 

i?2 < I{X2-, Y\U, Xi,Z2, 81,82) + H{Z2\U, 82), 

R1YR281 I{Xi,X2', Y\U,Zi,Z2,8i,82) Y H{Zi,Z2\U,8x,82), 
Ri+R 2 <I{Xi,X 2 -,Y\ 8 i, 82 ). 


Thus, we get that the region stated in Theorem|^is an outer bound to the achievable rate region. The bound on cardinality of the 
auxiliary random variable can be obtained using arguments based on Caratheodory’s theorem as described in |28 Appendix C]. 
This concludes the proof of Theorem ■ 


IX. Proof of Theorem[5] 

The achievability part of this theorem builds on the cooperative-bin-forward scheme from the previous section by combining 
it with instantaneous relaying (a.k.a. codetrees or Shannon strategies). To avoid unnecessary repetition, we only provide the 
differences in the achievability part relative to that in the previous section. 

Proof: 

Fix a pmf Pu{u)PXi\u,Si{xi\'^i S\)px2\u,S2,Zi{x2\u, S2, zi) and e > 0 . Rate-splitting is performed as in the previous section. 

Codebook Generation: 

The cooperation codewords and the codebooks used by Encoder 1 are generated in the same manner as the previous section. 
Encoder 2 generates codebooks by treating the causally observed zi symbol in the same manner as the causally observed S2 
symbol. More precisely, the codebooks constructed by Encoder 2 are described in the following two paragraphs. 

Eor each Ib-i, each S2 S S2 and each Zi G Zi, generate a codebook of codewords. The ith symbol of such a 

codeword is chosen independently according to Pz2\u,S2,Zi{'\ubi{lb-i), S2, zi). The result of this is that for each Ib-i, each 
f, G [1 : 2”^2], each each encoder 2 can form an effective codeword Z2b(™2 ^2b^ ^ib)’ whose zth symbol 

can be causally chosen as the ith symbol of the ^-th codeword from the codebook corresponding to Ib-i, S2bi and Zi^bt- 

Eor each h-i, each m2 ^ G [1 : 2 '^^^], each S2 G 82 and each zi G Zi, generate a codebook of 2 "^= codewords. The *th 
symbol of such a codeword is generated independently according to Px2\u,Z2,S2,Zi {'lubiilb-i), Z2bi(w2 S2, Zi), S2, zi). 

The result of this construction is that for each Ib-i, each m^ S [1 : 2"^2]^ each m^'f, G [1 : 2"^2]^ each and each 
encoder 2 can form an effective codeword X2{,(m2 ^14-1) s^},, z"^), whose *th symbol can be causally chosen as the *th 

symbol of the m^'f,-th codeword from the codebook corresponding to h-i, rn'^ j, S2bi and zi_bi- 

The binning is performed as in the previous section. 

Encoding: 

The encoding at Encoder 1 is identical to that in the previous section. Encoder 2 transmits a; 2 (,(m 2 \| 4 -i, z"j,) 

which can be chosen as described above. 

Decoding: 

The decoder performs backward decoding over the blocks, where it performs two steps as in the previous section, with the 
first step changed to the following. 

The decoder first takes a pass through all 4-i = (4,b-i, 4,b-i) and for each 4-i, finds the unique {m\ ^,m 2 f) such that 
binb(z 4 (mi ,,14-1, Sib)) = h,b and binb(z 2 b(TO 2 ,b| 4 -i, S 2 b, z”fc(mi J/b-i, 3 ”^))) = 4.b- 


Probability of Error: 

In the previous section, we obtained the conditions ([T 8 ]l-(| 25 ]l for vanishing probability of error. The only difference now is 
that the fourth condition is replaced by 

R'^<H{Z 2 \U, 82 ,Zi)- 6 ie). 

This is obtained by analyzing the probability of event ( 6 ) conditioned on the complement of event (a). The other conditions 
remain the same. Performing Eourier-Motzkin elimination of Ri, R2, R[, R2, Ri and R2, and letting n —i' 00 , B —i' 00 and 
e —>^ 0 , we get that communication at arbitrarily small error probability is possible for the rates specihed in Theorem 
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Converse: 

The only difference in the converse compares to that of the previous section is that we need to show a different bound on 
i ?2 and we need to prove the factorization of the pmf. The bound on Ri and the two bounds on the sum rate Ri + R 2 are the 
same and require no changes in the arguments. 

The new bound on R 2 can be shown by following the same line of arguments with minor changes. We provide the chain of in¬ 
equalities below for completeness. The auxiliary random variable Ui appearing below is defined to be 


ni?2 = H{M2) 

H{M 2 \M,,S'^,S^) 

HiM 2 ,Z^\M^,S'^,S^) 

= H{Z^\Mi,S^,S^)+H{M2\Z^,Mi,S'^,S^) 

< S^) + /(M2; y"|Z2", Ml, S^) + ne„ 

n n 

= ^ M(Z 2 ,|zr\ Ml, 5 i", ^2”) + 51 ^(^2; Z2", Ml, 5 r, ^2”) + ntr, 


H{Z2^\Zif\M,,S^, Zj) + 5 ] /(M2; Z^, Mi, S^, xn + ne„ 


M(Z2,|zr\ Ml, Zl) + Y X2^; Y,\Y^-\Z^, Mi, X”) + ne„ 


if) 


'< YHiZ2^\Z^2~\Sl,Sl-\Zl) + YHX2^;Y,\ZlS{,Sl,Xu,Zl-^)+ner, 


= Y H{Z2^\U,, S2^, Zu) + Y ^(^2*; Y,\Ui, Z2., 5 i„ S2^, Xh) + ne„ 

= nH{Z2Q\UQ,S2Q,ZiQ,Q) +nI{X2Q-,YQ\UQ,Z2Q,SiQ,S2Q,XiQ,Q) + ne„ 


where 

- Q is a random variable uniformly distributed on [1 : n], independent of other random variables, 

- (a) follows because M 2 is independent of (Mi, S'", ), 

- (b) follows since Z 2 is a function of (Mi, M 2 , S", SJ), 

- (c) follows by Fano’s inequality, 

- (d) follows since (X^jZ^) is a function of (Mi, S", 

- (e) follows since X2i is a function of {M2, S2 , XI), 

- (/) follows since (i) conditioning reduces entropy and (ii) conditioned on (Xij, X2i, Sii, S2i), Yi is independent of 
(Mi,M 2 ,Xi",X 2 ",Si",S 2 "). 

Regarding the joint pmf, we note the following conditions 

- (SiQ, S2q) is independent of {Q, Uq), and has the pmf psi.Sa (si, si), 

- XiQ - {Q,Uq,Siq) - S 2 Q, 

- X2Q - {Q,Uq,S 2 Q,Ziq) - {Xiq,Siq), 

- VYQ\UQ,x^Q,x^Q,S^Q,S^Q{y\u,xi,X2,si,S2) is equal to 0:2, si, S2), 

- ZiQ = zi{XiQ, Siq) and Z2Q = Z2{X2Q, S2q)- 

The first, fourth and fifth conditions do not need new arguments. The second and third condition can be proved as follows. 
For any 1 < z < n, 

p{Si, S2, Xii, Z-^ , Z2 ) 

mi ,m 2 

i -1 i -1 

= Y Pi^i)Pi'^2)pis\, S2)p{xu\mi, s\, Z2~^)Y[p{zij\'mi, s{, z^2~^)Y[piz2j\m2, si, z{) 

mi,m2 j —1 j — 1 

= p{s\, S^pixii, 2j"^|s*i, z{). 

This implies Xiq — (t/g, 5 'ig, Q) — S2Q- For the third condition, we have for any 1 < i < n, 



23 


= ^ p{mi,m 2 ,s\,sl,xu,X 2 i,zl ^,zu,zl 


= E 


p{mi)p{m2)pis\,sl)p{xu\mi,s\,Z2 ( ]^p(zij|mi, sj, 

i=i 


X p{x2^\m2,S2,zl) 

= P{s\, S 2 )'^p{mi)p{xu, z\\mi, s\, zl~^) '^p{m2)p{x2i, zl~^\m2, S 2 ,z\) 

mi m 2 

= Pis\,S2)p{Xii, Zl\s\, zl~^)p{X2i, zl~'^\S2,zl) 


The above factorization implies that 

^2, - {Sl,Zl,Z^2~^) - (Xu, Su), for all 1 < i < n 

^ X2i - (Ui, S2i, Zu) - (Xu, Su), for all 1 < i < n 

^ X 2 Q - iUQ,S2Q,ZiQ,Q) - {Xiq,Siq). 

So we can define random variables U = (Q, Uq), Xi = Xiq, X2 = X2Q, Si = S'lg, S '2 — S2Q, and Y = Yq, such that 
the joint pmf of these random variables has the factorization 


PSi,S2isi, S 2 )Pu(u)PXi\U,Si{xi\u, Si)px^\U,S2,Zi(x2\u, S 2 , Zi)pY\Xi,X2,Si,S2iy\^l^ ^ 2 , Si, S2), 


and Zi = zi(Xi,Si) and Z 2 = Z 2 {X 2 , S 2 )- Noting that e„ —?► 0 as n —>■ 00 , the four constraints on the rates that we have 
established become 


Ri < I(XuY\U, X2,Zi,SuS2) + H{Zi\U, Si), 
R 2 <I{X 2 -,Y\U,Xi,Z 2 ,Si,S 2 ) + H(Z 2 \U,S 2 ,Zi), 
Ri+R2<I{Xi,X2-,Y\U,Zi,Z2,Si,S2)+H{Zi,Z2\U,Si,S2), 
Ri+R 2 <I{Xi,X 2 -,Y\Si,S 2 ). 


Thus, we get that the region stated in Theoremj^is an outer bound to the achievable rate region. The bound on cardinality of the 
auxiliary random variable can be obtained using arguments based on Caratheodory’s theorem as described in |28 Appendix C]. 
This concludes the proof of Theorem ■ 


X. Concluding Remarks and Some Open Problems 

We presented the cooperative-bin-forward scheme and showed that it achieves the capacity region in a variety of semide- 
terministic setups. While partial-decode-forward has been the scheme of interest in semideterministic setups, we demonstrated 
the strict advantages of cooperative-bin-forward by considering state-dependent setups, where partial-decode-forward cannot 
be applied, but cooperative-bin-forward is optimal. 

A number of interesting questions remain. Most importantly, how can the cooperative-bin-forward scheme be extended to, e.g. 
the model in Figure when the source-relay link is not deterministic, but a general noisy link? Cooperative-bin-forward was 
developed in this paper as an alternative to partial-decode-forward. However, the latter has an advantage of extending naturally 
to the noisy case due to the decoding operation at the relay. The crucial high-level ingredient for establishing cooperation that is 
used by both schemes is that different nodes agree on some information. In partial-decode-forward, the agreement is established 
via a decoding operation at the relay. In cooperative-bin-forward, the decoding operation was removed and the agreement was 
established by exploiting the deterministic components in the models. To ensure some kind of agreement between nodes in 
the general noisy case without using a decoding operation, the similarity between the operations of binning and compression 
suggest an approach. Note that binning is a form of compression, so a natural extension of cooperative-bin-forward might 
involve a compression operation at the relay, where part of the compression can be reconstructed at the source, thus enabling 
some cooperation between the source and the relay. 

Another interesting question is that of designing optimal achievability schemes for all the state-dependent setups considered 
in this paper when the state is known only to the source encoders, causally or strictly causally. Finally, the semideterministic 
relay channel with two state components, one known to the source and the other to the relay, with an uninformed destination, 
is also an interesting open question. 
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